# Multimodal vision model
Owlv2 Large Patch14 Ensemble
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can detect objects in images through text queries.
Text-to-Image
Transformers

O
Thomasboosinger
1
0
Owlv2 Base Patch16
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can detect and locate objects in images through text queries.
Text-to-Image
Transformers

O
vvmnnnkv
26
0
Owlv2 Large Patch14 Finetuned
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can detect objects in images through text queries without requiring category-specific training data.
Text-to-Image
Transformers

O
google
1,434
4
Owlv2 Base Patch16 Finetuned
Apache-2.0
OWLv2 is a zero-shot text-conditioned object detection model that can retrieve objects in images through text queries.
Object Detection
Transformers

O
google
2,698
3
Owlvit Base Patch32
Apache-2.0
OWL-ViT is a zero-shot text-conditioned object detection model that can search for objects in images via text queries without requiring category-specific training data.
Text-to-Image
Transformers

O
google
764.95k
129
Featured Recommended AI Models